Search CORE

95 research outputs found

Measurement properties of performance-based instruments to assess mental function during activity and participation in persons who have survived a stroke:a systematic review protocol

Author: Kristensen Hanne Kaae
Kristensen Lola Qvist
Mokkink Lidwine B
Muren Marie Almkvist
Oestergaard Lisa Gregersen
Tulder M.W. van
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2023
Field of study

Reliability of brain atrophy measurements in multiple sclerosis using MRI: an assessment of six freely available software packages for cross-sectional analyses

Author: Amiri Houshang
Barkhof Frederik
Brouwer Iman
Kuijer Joost PA
Mokkink Lidwine B
Noteboom Samantha
Van Nederpelt David R
Vrenken Hugo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2023
Field of study

PURPOSE: Volume measurement using MRI is important to assess brain atrophy in multiple sclerosis (MS). However, differences between scanners, acquisition protocols, and analysis software introduce unwanted variability of volumes. To quantify theses effects, we compared within-scanner repeatability and between-scanner reproducibility of three different MR scanners for six brain segmentation methods. METHODS: Twenty-one people with MS underwent scanning and rescanning on three 3 T MR scanners (GE MR750, Philips Ingenuity, Toshiba Vantage Titan) to obtain 3D T1-weighted images. FreeSurfer, FSL, SAMSEG, FastSurfer, CAT-12, and SynthSeg were used to quantify brain, white matter and (deep) gray matter volumes both from lesion-filled and non-lesion-filled 3D T1-weighted images. We used intra-class correlation coefficient (ICC) to quantify agreement; repeated-measures ANOVA to analyze systematic differences; and variance component analysis to quantify the standard error of measurement (SEM) and smallest detectable change (SDC). RESULTS: For all six software, both between-scanner agreement (ICCs ranging 0.4–1) and within-scanner agreement (ICC range: 0.6–1) were typically good, and good to excellent (ICC > 0.7) for large structures. No clear differences were found between filled and non-filled images. However, gray and white matter volumes did differ systematically between scanners for all software (p < 0.05). Variance component analysis yielded within-scanner SDC ranging from 1.02% (SAMSEG, whole-brain) to 14.55% (FreeSurfer, CSF); and between-scanner SDC ranging from 4.83% (SynthSeg, thalamus) to 29.25% (CAT12, thalamus). CONCLUSION: Volume measurements of brain, GM and WM showed high repeatability, and high reproducibility despite substantial differences between scanners. Smallest detectable change was high, especially between different scanners, which hampers the clinical implementation of atrophy measurements

UCL Discovery

Interobserver variability studies in diagnostic imaging: a methodological systematic review

Author: De Vet Henrica CW
Deeks Jon
Mallett Sue
Mokkink Lidwine B
Quinn Laura
Sitch Alice
Takwoingi Yemisi
Taylor-Phillips Sian
Tryposkiadis Konstantinos
Publication venue: 'British Institute of Radiology'
Publication date: 01/01/2023
Field of study

OBJECTIVES: To review the methodology of interobserver variability studies; including current practice and quality of conducting and reporting studies. METHODS: Interobserver variability studies between January 2019 and January 2020 were included; extracted data comprised of study characteristics, populations, variability measures, key results, and conclusions. Risk of bias was assessed using the COSMIN tool for assessing reliability and measurement error. RESULTS: Seventy-nine full-text studies were included covering various imaging tests and clinical areas. The median number of patients was 47 (IQR:23-88), and observers were 4 (IQR:2-7), with sample size justified in 12 (15%) studies. Most studies used static images (n = 75, 95%), where all observers interpreted images for all patients (n = 67, 85%). Intraclass correlation coefficients (ICC) (n = 41, 52%), Kappa (κ) statistics (n = 31, 39%) and percentage agreement (n = 15, 19%) were most commonly used. Interpretation of variability estimates often did not correspond with study conclusions. The COSMIN risk of bias tool gave a very good/adequate rating for 52 studies (66%) including any studies that used variability measures listed in the tool. For studies using static images, some study design standards were not applicable and did not contribute to the overall rating. CONCLUSIONS: Interobserver variability studies have diverse study designs and methods, the impact of which requires further evaluation. Sample size for patients and observers was often small without justification. Most studies report ICC and κ values, which did not always coincide with the study conclusion. High ratings were assigned to many studies using the COSMIN risk of bias tool, with certain standards scored 'not applicable' when static images were used. ADVANCES IN KNOWLEDGE: The sample size for both patients and observers was often small without justification. For most studies, observers interpreted static images and did not evaluate the process of acquiring the imaging test, meaning it was not possible to assess many COSMIN risk of bias standards for studies with this design. Most studies reported intraclass correlation coefficient and κ statistics; study conclusions often did not correspond with results

UCL Discovery

Inter-rater agreement and reliability of the COSMIN (COnsensus-based Standards for the selection of health status Measurement Instruments) Checklist

Author: Caroline B Terwee
Dirk L Knol
Donald L Patrick
E Moberg-Mogren
Elizabeth Gibbons
HC Kraemer
Henrica CW de Vet
JL Fleiss
JM Valderas
Jordi Alonso
JR Landis
L Lin
LB Mokkink
LB Mokkink
LB Mokkink
Lex M Bouter
Lidwine B Mokkink
N Smidt
Paul W Stratford
W Vach
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The COSMIN checklist is a tool for evaluating the methodological quality of studies on measurement properties of health-related patient-reported outcomes. The aim of this study is to determine the inter-rater agreement and reliability of each item score of the COSMIN checklist (n = 114). Methods 75 articles evaluating measurement properties were randomly selected from the bibliographic database compiled by the Patient-Reported Outcome Measurement Group, Oxford, UK. Raters were asked to assess the methodological quality of three articles, using the COSMIN checklist. In a one-way design, percentage agreement and intraclass kappa coefficients or quadratic-weighted kappa coefficients were calculated for each item. Results 88 raters participated. Of the 75 selected articles, 26 articles were rated by four to six participants, and 49 by two or three participants. Overall, percentage agreement was appropriate (68% was above 80% agreement), and the kappa coefficients for the COSMIN items were low (61% was below 0.40, 6% was above 0.75). Reasons for low inter-rater agreement were need for subjective judgement, and accustom to different standards, terminology and definitions. Conclusions Results indicated that raters often choose the same response option, but that it is difficult on item level to distinguish between articles. When using the COSMIN checklist in a systematic review, we recommend getting some training and experience, completing it by two independent raters, and reaching consensus on one final rating. Instructions for using the checklist are improved.</p

Crossref

VU Research Portal

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Oxford University Research Archive

UPF Digital Repository

Interobserver variability studies in diagnostic imaging:a methodological systematic review

Author: Deeks Jon
Mallett Sue
Mokkink Lidwine B.
Quinn Laura
Sitch Alice
Takwoingi Yemisi
Taylor-Phillips Sian
Tryposkiadis Konstantinos
Vet Henrica C.W. De
Publication venue
Publication date: 29/06/2023
Field of study

Objectives: To review the methodology of interobserver variability studies; including current practice and quality of conducting and reporting studies. Methods: Interobserver variability studies between January 2019 and January 2020 were included; extracted data comprised of study characteristics, populations, variability measures, key results, and conclusions. Risk of bias was assessed using the COSMIN tool for assessing reliability and measurement error. Results: Seventy-nine full-text studies were included covering various imaging tests and clinical areas. The median number of patients was 47 (IQR:23–88), and observers were 4 (IQR:2–7), with sample size justified in 12 (15%) studies. Most studies used static images (n = 75, 95%), where all observers interpreted images for all patients (n = 67, 85%). Intraclass correlation coefficients (ICC) (n = 41, 52%), Kappa (κ) statistics (n = 31, 39%) and percentage agreement (n = 15, 19%) were most commonly used. Interpretation of variability estimates often did not correspond with study conclusions. The COSMIN risk of bias tool gave a very good/adequate rating for 52 studies (66%) including any studies that used variability measures listed in the tool. For studies using static images, some study design standards were not applicable and did not contribute to the overall rating. Conclusions: Interobserver variability studies have diverse study designs and methods, the impact of which requires further evaluation. Sample size for patients and observers was often small without justification. Most studies report ICC and κ values, which did not always coincide with the study conclusion. High ratings were assigned to many studies using the COSMIN risk of bias tool, with certain standards scored ‘not applicable’ when static images were used. Advances in knowledge: The sample size for both patients and observers was often small without justification. For most studies, observers interpreted static images and did not evaluate the process of acquiring the imaging test, meaning it was not possible to assess many COSMIN risk of bias standards for studies with this design. Most studies reported intraclass correlation coefficient and κ statistics; study conclusions often did not correspond with results

University of Birmingham Research Portal

Warwick Research Archives Portal Repository

Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist

Author: A. D. Furlan
Caroline B. Terwee
CB Terwee
CB Terwee
Dirk L. Knol
GH Guyatt
H Wind
HCW Vet de
HCW Vet de
Henrica C. W. de Vet
J Marinus
J Stevens
JL Brozek
JM Valderas
KL Haywood
LB Mokkink
LB Mokkink
LB Mokkink
Lex M. Bouter
Lidwine B. Mokkink
Raymond W. J. G. Ostelo
S Alla
Scientific Advisory Committee of the Medical Outcomes Trust
Publication venue: Springer Netherlands
Publication date: 01/01/2011
Field of study

Background: The COSMIN checklist is a standardized tool for assessing the methodological quality of studies on measurement properties. It contains 9 boxes, each dealing with one measurement property, with 5-18 items per box about design aspects and statistical methods. Our aim was to develop a scoring system for the COSMIN checklist to calculate quality scores per measurement property when using the checklist in systematic reviews of measurement properties. Methods: The scoring system was developed based on discussions among experts and testing of the scoring system on 46 articles from a systematic review. Four response options were defined for each COSMIN item (excellent, good, fair, and poor). A quality score per measurement property is obtained by taking the lowest rating of any item in a box ("worst score counts"). Results: Specific criteria for excellent, good, fair, and poor quality for each COSMIN item are described. In defining the criteria, the "worst score counts" algorithm was taken into consideration. This means that only fatal flaws were defined as poor quality. The scores of the 46 articles show how the scoring system can be used to provide an overview of the methodological quality of studies included in a systematic review of measurement properties. Conclusions: Based on experience in testing this scoring system on 46 articles, the COSMIN checklist with the proposed scoring system seems to be a useful tool for assessing the methodological quality of studies included in systematic reviews of measurement properties. © The Author(s) 2011

Crossref

VU Research Portal

Springer - Publisher Connector

PubMed Central

The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study

Author: AP Verhagen
B Kirshner
C Powell
C Veenhof
Caroline B. Terwee
CB Terwee
Dirk L. Knol
Donald L. Patrick
HCW Vet De
Henrica C. W. de Vet
JC Nunnally
JM Bland
JM Valderas
Jordi Alonso
KN Lohr
LB Mokkink
LB Mokkink
LE Pfennings
Lex M. Bouter
Lidwine B. Mokkink
M Marshall
MR Boer De
Paul W. Stratford
S Evers
US Department of Health and Human Services FDA Center for Drug Evaluation and Research
Publication venue: Springer Netherlands
Publication date: 01/01/2010
Field of study

BACKGROUND: Aim of the COSMIN study (COnsensus-based Standards for the selection of health status Measurement INstruments) was to develop a consensus-based checklist to evaluate the methodological quality of studies on measurement properties. We present the COSMIN checklist and the agreement of the panel on the items of the checklist. METHODS: A four-round Delphi study was performed with international experts (psychologists, epidemiologists, statisticians and clinicians). Of the 91 invited experts, 57 agreed to participate (63%). Panel members were asked to rate their (dis)agreement with each proposal on a five-point scale. Consensus was considered to be reached when at least 67% of the panel members indicated 'agree' or 'strongly agree'. RESULTS: Consensus was reached on the inclusion of the following measurement properties: internal consistency, reliability, measurement error, content validity (including face validity), construct validity (including structural validity, hypotheses testing and cross-cultural validity), criterion validity, responsiveness, and interpretability. The latter was not considered a measurement property. The panel also reached consensus on how these properties should be assessed. CONCLUSIONS: The resulting COSMIN checklist could be useful when selecting a measurement instrument, peer-reviewing a manuscript, designing or reporting a study on measurement properties, or for educational purposes.This study was financially supported by the EMGO Institute for Health and Care Research, VU University Medical Center, Amsterdam, and the Anna Foundation, Leiden, The Netherlands

Crossref

VU Research Portal

Springer - Publisher Connector

PubMed Central

UPF Digital Repository

Preregistering Qualitative Research: A Delphi Study

Author: Errington Timothy M
Gleditsch Kristian Skrede
Haven Tamarinde L
Jacobs Alan M
Kern Florian G
Mokkink Lidwine B
Piñeiro Rafael
Rosenblatt Fernando
van Grootel Leonie
Publication venue: 'SAGE Publications'
Publication date: 01/01/2020
Field of study

Preregistrations—records made a priori about study designs and analysis plans and placed in open repositories—are thought to strengthen the credibility and transparency of research. Different authors have put forth arguments in favor of introducing this practice in qualitative research and made suggestions for what to include in a qualitative preregistration form. The goal of this study was to gauge and understand what parts of preregistration templates qualitative researchers would find helpful and informative. We used an online Delphi study design consisting of two rounds with feedback reports in between. In total, 48 researchers participated (response rate: 16%). In round 1, panelists considered 14 proposed items relevant to include in the preregistration form, but two items had relevance scores just below our predefined criterion (68%) with mixed argument and were put forth again. We combined items where possible, leading to 11 revised items. In round 2, panelists agreed on including the two remaining items. Panelists also converged on suggested terminology and elaborations, except for two terms for which they provided clear arguments. The result is an agreement-based form for the preregistration of qualitative studies that consists of 13 items. The form will be made available as a registration option on Open Science Framework (osf.io). We believe it is important to assure that the strength of qualitative research, which is its flexibility to adapt, adjust and respond, is not lost in preregistration. The preregistration should provide a systematic starting point

University of Essex Research Repository

VU Research Portal

Crossref

The COSMIN checklist for evaluating the methodological quality of studies on measurement properties: A clarification of its content

Author: C Powell
CA McHorney
Caroline B Terwee
CB Terwee
CM Goodman
DA Revicki
DG Altman
Dirk L Knol
DL Streiner
DL Streiner
Donald L Patrick
DW Levine
F Hasson
FJ Floyd
GH Guyatt
GJ Van der Heijden
GR Norman
H De Vet
HC De Vet
Henrica CW de Vet
I McDowell
IB Wilson
J Cohen
JM Cortina
Jordi Alonso
LB Mokkink
LB Mokkink
LB Mokkink
Lex M Bouter
Lidwine B Mokkink
LJ Cronbach
LJ Cronbach
ME Strauss
MR De Boer
MR Stockler
Paul W Stratford
PM Fayers
S Keeney
S Messick
World Health Organization
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The COSMIN checklist (COnsensus-based Standards for the selection of health status Measurement INstruments) was developed in an international Delphi study to evaluate the methodological quality of studies on measurement properties of health-related patient reported outcomes (HR-PROs). In this paper, we explain our choices for the design requirements and preferred statistical methods for which no evidence is available in the literature or on which the Delphi panel members had substantial discussion. Methods The issues described in this paper are a reflection of the Delphi process in which 43 panel members participated. Results The topics discussed are internal consistency (relevance for reflective and formative models, and distinction with unidimensionality), content validity (judging relevance and comprehensiveness), hypotheses testing as an aspect of construct validity (specificity of hypotheses), criterion validity (relevance for PROs), and responsiveness (concept and relation to validity, and (in) appropriate measures). Conclusions We expect that this paper will contribute to a better understanding of the rationale behind the items, thereby enhancing the acceptance and use of the COSMIN checklist.</p

Crossref

VU Research Portal

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

UPF Digital Repository

Key learning outcomes for clinical pharmacology and therapeutics education in Europe: A modified Delphi study.

Author: David J. Brinkman
Emilio J. Sanz
Jelle Tichelaar
Joao Costa
Lidwine B. Mokkink
Michiel A. van Agtmael
Milan C. Richir
Robert Likic
Romaldas Maciulaitis
Simon R. Maxwell
Thierry Christiaens
Publication venue: 'Wiley'
Publication date: 28/10/2022
Field of study

Harmonizing clinical pharmacology and therapeutics (CPT) education in Europe is necessary to ensure that the prescribing competency of future doctors is of an uniform high standard. As there are currently no uniform requirements, our aim was to achieve consensus on key learning outcomes for undergraduate CPT education in Europe. We used a modified Delphi method consisting of three questionnaire rounds and a panel meeting. 129 experts from 27 European countries were asked to rate 307 learning outcomes. 92 experts (71%) completed all three questionnaire rounds, and 33 experts (26%) attended the meeting. 232 learning outcomes from the original list, 15 newly suggested and 5 rephrased outcomes were included. These 252 learning outcomes should be included in undergraduate CPT curricula to ensure that European graduates are able to prescribe safely and effectively. We provide a blueprint of a European core curriculum describing when and how the learning outcomes might be acquired. This article is protected by copyright. All rights reserved

UTUPub